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ABSTRACT 

We derive and interpret some relations between the luminosity, mass, and 
age distributions of star clusters, denoted here by <fi(L), ip(M), and x( r )) respec- 
tively Of these, <f)(L) is the easiest to determine observationally, whereas ip(M) 
and x( r ) are more informative about formation and disruption processes. For a 
population of young clusters, with a relatively wide range of ages, (fi(L) depends 
on both ip(M) and x{ T ) an d thus cannot serve as a proxy for ip(M) in general. 
We demonstrate this explicitly by four illustrative examples with specific forms 
for either ip(M) or x( r )- 111 the special case in which ip(M) is a power law and is 
independent of x( r )> however, <j)(L) is also a power law with the same exponent 
as ip(M). We conclude that this accounts for the observed similarity between 
<f)(L) and ip(M) for the young clusters in the Antennae galaxies. This result 
reinforces our picture in which clusters form with ip(M) oc M~ 2 and are then 
disrupted rapidly at a rate roughly independent of their masses. The most likely 
disruptive process in this first stage is the removal of interstellar matter by the 
energy and momentum input from young stars (by photoionization, winds, jets, 
and supernovae). The few clusters that avoid this "infant mortality" are eventu- 
ally disrupted in a second stage by the evaporation of stars driven by two-body 
relaxation, a process with a strong dependence on mass. We suspect this picture 
may apply to many, if not all, populations of star clusters, but this needs to be 
verified observationally by determinations of ijj(M) and x{ T ) m more galaxies. 

Subject headings: galaxies: star clusters — stars: formation 



1. INTRODUCTION 



In this paper, we derive and interpret some relations between the luminosity, mass, 
and age distributions of young star clusters. We define these "distributions" (probability 
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densities, in fact) as follows: 4>(L)dL is the fraction of clusters with luminosities between L 
and L + dL, ip(M)dM is the fraction of clusters with masses between M and M + dM, and 
x(T)dr is the fraction of clusters with ages between r and r + dr. We focus here on the 
question: To what extent does (/>(L) reflect i/)(M), and how does this depend on x(r)? 

For a population of clusters with the same mass-to-light ratio, resulting from a common 
age and stellar mass function (and neglecting any mass dependence of stellar escape rates), 
(f>(L) always has the same form as ip(M). This assumption is often made in studies of old 
globular clusters, when their colors or other evidence indicates that the spread in ages is 
modest or narrow (in a fractional sense). However, for a population of clusters with a wide 
range of mass-to-light ratios, <j)(L) can be very different from ip(M). This is the generic 
case when the spread in ages is broad in fractional terms, as happens when clusters form 
continually up to the present time. We are concerned in this paper primarily with this 
situation. 

The luminosity function of a population of clusters is relatively easy to determine be- 
cause it requires observations in only one photometric band. Consequently, there is now an 
extensive literature on this subject. In studies with deep observations of large samples of 
young clusters, the luminosity function is often found to have an approximate power-law 
form, <f>(L) oc L a , with an exponent near a ~ —2. Early examples of this result include the 
Milky Way (van den Bergh & LaFontaine 1984), the Large Magellanic Cloud (Elson & Fall 
1985), and Messier 33 (Christian & Schommer 1988). A more recent example, and the one 
that motivates the work presented here, is the interacting Antennae galaxies (Whitmore et 
al. 1999). In some cases, the luminosity function is a power law in a first approximation, but 
also appears to have some secondary convex curvature in log-log plots. However, this curva- 
ture often has only marginal statistical significance, and conceivably it could be the result of 
subtle systematic effects, such as under correct ions for incompleteness or photometric errors 
at faint magnitudes. We ignore such curvature here in the interest of keeping our analysis 
as simple and transparent as possible. 

Because the luminosity function is affected by the fading of the stellar populations within 
the clusters, it does not tell us directly about the formation and disruption of the clusters. 
The mass and age distributions are more fundamental in this regard. In particular, for the 
youngest clusters, ip(M) is likely to reflect fairly directly the physical processes involved in 
the formation of the clusters and/or their parent molecular clouds. The initial mass function 
of clusters thus plays a role in the theory of cluster formation similar to that of the initial 
mass function of stars in the theory of star formation. The age distribution, in principle, 
reflects a combination of the formation and disruption histories of the clusters. In practice, 
however, x( T ) is primarily a diagnostic of disruption processes, because it usually has a 
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stronger dependence on r than is plausible for the formation history. 

The mass and age distributions require photometry in several bands, corrections for 
interstellar redding, and comparisons with stellar population models. This makes their 
determination more laborious than that of the luminosity function. Moreover, estimating 
two univariate distributions or one bivariate distribution (of M and r) to a given level of 
statistical accuracy requires a larger sample of clusters than estimating a single univariate 
distribution (of L) to the same accuracy. Consequently, we have reliable determinations of 
the mass and age distributions of clusters in only a few galaxies, the best example again 
being the Antennae. In this case, the mass and age distributions can be represented by the 
power laws ip(M) oc M 13 with (3 « -2 (Zhang & Fall 1999) and x(r) oc r 7 with 7 « -1 
(Fall, Chandar, & Whitmore 2005). The exponent of x( T ) quoted here is for mass-limited 
samples, as is required in all the formulae of this paper; for luminosity-limited samples, x{ T ) 
is significantly steeper, as a consequence of the rapid fading of the clusters. Furthermore, 
over the observed domain of masses and ages, ip(M) and xi T ) are approximately independent 
of each other (as shown in the previous references). This fact simplifies much of the analysis 
in this paper. 

The observations of the Antennae galaxies raise an interesting question: Why do the 
luminosity and mass functions of the young clusters have the same, or at least approximately 
the same form? i.e., why are they both power laws with the same exponent? In much of the 
literature on star clusters, there is a tendency to regard the luminosity function as a proxy 
for the mass function. As noted above, this cannot be true in general, given that clusters of 
different ages have different mass-to-light ratios. Yet the evidence from the Antennae galaxies 
suggests that <f>(L) and ip(M) are in fact closely related. Why is this? Answering this question 
is the main purpose of this paper. Although our analysis is motivated by observations of the 
Antennae galaxies, it seems likely on theoretical grounds that our conclusions have wider 
applicability, possibly to most or even all galaxies with large populations of young clusters. 

2. GENERAL RELATIONS 

The general relation between the luminosity and mass functions of star clusters must 
also involve the age distribution. With this in mind, we introduce the following bivariate 
distributions: f(L, r)dLdr is the fraction of clusters with luminosities between L and L + dL 
and ages between r and r + dr, and g(M, r)dMdr is the fraction of clusters with masses 
between M and M + dM and ages between r and r + dr. In terms of these distributions, 
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the univariate distributions of luminosity, mass, and age are 

POO 

4>(L)= / f(L,r)dr, (1) 
Jo 

poo 

V>(M)= / ^(M,r)rfr, (2) 

JO 

poo poo 

X (r)= f(L,r)dL= g(M,r)dM. (3) 
Jo Jo 

We now make our first simplifying assumption, that the mass-to-light ratios of clusters, 
denoted by ji, depend only on their ages: 

[i(t)=M(t)/L(M,t). (4) 

In principle, fi could vary among clusters of the same age if, for example, they had different 
stellar mass functions. In practice, such variations are found or assumed to be small, and 
equation (4) is thus the basis for all determinations of the mass functions of clusters from 
multiband photometry. In this case, the bivariate distributions are related by 

f(L,T)=g(M,T)\(dM/dL) T \=g(M,T)»(T). (5) 

Combining equations (1), (4), and (5) yields 

roo 

4>{L)= / gW)L,r]^r)dr. (6) 
Jo 

Given g(M, r), equations (2), (3), and (6) determine fully the relations between 4>(L), ip(M), 
and xij)- These relations in general are quite complicated. 

We now make our second simplifying assumption, that the mass and age distributions 
of clusters are independent of each other: 

g(M,r)=^M)x(r). (7) 

This is valid if the rates of formation and disruption of the clusters are independent of their 
masses. For very young clusters, which dominate the bright parts of <f>(L), the main disruptive 
effect is loss of interstellar matter by the energy and momentum input from young stars 
(photoionization, stellar winds and jets, and supernovae remnants). These inputs should be 
roughly proportional to the masses of the clusters and hence also to the amount of material to 
be removed. Thus, we expect the disruption rate to be roughly independent of the masses of 
the clusters, as prescribed by equation (7) above (see Fall et al. 2005 for further discussion of 
this and related issues). This is consistent with our observations of the young clusters in the 
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Antennae galaxies, where we have determined g(M, r) for r < 1O 7 (M/1O 4 M ) L3 yr. Once the 
evaporation of stars by two-body relaxation becomes important, the disruption rate is mass- 
dependent, and equation (7) is no longer valid. This occurs for r > 5 x 1O 8 (M/1O 4 M ) yr, 
i.e., well outside the observed mass-age domain of clusters in the Antennae galaxies (see 
Section 4 below). 

When we insert equation (7) into equation (6), we obtain 

POD 

<t>(L) = / ^(r)L} X (THr)dT. (8) 
Jo 

It is helpful at this point to use the mass-to-light ratio fx as a substitute for the age r. With 
this in mind, we introduce the corresponding distribution, defined such that 9(fx)dfi is the 
fraction of clusters with mass-to-light ratios between fx and fx + dfx. This is related to the 
age distribution by 

6(jJ.)=X(T)\dT/dlM\. (9) 

Combining this with equation (8) gives 

POO 

<f>(L) = / (io) 

JO 

This is the simplest form of the general relation between the luminosity, mass, and age (or 
mass-to-light ratio) distributions subject to the two simplifying assumptions embodied in 
equations (4) and (7). To make further progress, we must specify ip(M) and/or 9(fx). 



3. ILLUSTRATIVE EXAMPLES 

We now consider four simple, illustrative examples, based on specific assumptions about 
the distributions of masses and mass-to-light ratios (delta functions and power laws). We 
could extend the list of examples indefinitely, but those presented here are sufficient to 
illustrate our main conclusions. 

Example 1: delta-function distribution of mass-to-light ratios, 9 (fx) = 5 (fx — fx ), and 
arbitrary distribution of masses, xb(M). In this case, all clusters have the same mass-to-light 
ratio fxo and hence the same age To given by fx = fx(ro). The luminosity function, from 
equation (10), is 

(f>(L) = fx ^(fx L). (11) 

Here, (f>(L) and if)(M) are identical apart from a rescaling of their arguments by fiQ. This is 
appropriate for populations of clusters that are observed long after they formed, with typical 
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ages much larger than their age spreads (i.e., r 3> At). This is probably a reasonable 
approximation for old globular clusters. However, it is a poor approximation for populations 
of clusters that are still forming, such as that in the Antennae galaxies, since in this case the 
typical age is comparable to the age spread (i.e., r ~ At). 

Example 2: power-law distribution of mass-to-light ratios, 9(fi) = Afi e , and arbitrary 
distribution of masses, ip(M). According to equation (10), the luminosity function is 



In this case, 4>{L) is a power law with exponent a = — (2 + e) irrespective of the form of 
ip(M). This example illustrates in an extreme way that <f>(L), in general, depends on x( r )i 
and not on ip(M) alone. Equation (12) is valid so long as the integral on the right-hand side 
exists. This condition is satisfied whenever ip{M) is finite over a finite range of M and zero 
elsewhere, as is true for real populations of star clusters. A power-law model of ip(M), if 
extended both to M = and M — > oo, would, however, cause an artificial divergence of the 
integral in equation (12). 

It is worth exploring this example in a little more detail. Stellar population models 
indicate that the mass-to-light ratio evolves approximately as a power law, /i(t) oc t 5 , over 
a wide range of ages, r > few x 10 6 yr (after smoothing over many small wiggles), with 
5 ~ 0.7 for luminosities in the V band, and larger (smaller) exponents for shorter (longer) 
wavelengths (Leitherer et al. 1999; Bruzual & Chariot 2003; see e.g. Fig. 1 of Fall et al. 
2005). Assuming that the age distribution is also a power law, x( r ) ^ t7 j we then have, 
from equations (9) and (12), 6{ji) oc // with e = (7 — 5 + l)/5, and 4>(L) oc L a with 



In this case, the exponent of 4>(L) depends on the exponent of x( T ) an d, except in the special 
case 7 = — 1, on the photometric band (through 5). 

For the young clusters in the Antennae galaxies, equation (13) and the observed age 
distribution, with 7 w — 1, predict a ~ — 1, and hence a luminosity function significantly 
shallower than the observed one, with a ~ —2. Equation (12), therefore, does not explain the 
observed form of 0(L) in the Antennae galaxies. We emphasize that this conclusion depends 
crucially on the observed decline of x( r )- 111 the absence of this information, equation (12) 
would provide an acceptable explanation for the observed form of (f>(L), as the following 
example shows. For 7^0, corresponding to x{ T ) ~ const and hence little if any disruption 
of clusters, equation (13) implies a ~ —2, close to the observed exponent of (f>(L) in many 
galaxies. Thus, without evidence to the contrary, we could speculate that nearly uniform 




(12) 



a — — 



1-(1 + 7 )A 



(13) 
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age distributions, rather than power-law mass functions, are responsible for the observed 
power-law luminosity functions ^ 

Example 3: delta-function distribution of masses, ip(M) = 5(M — M ), and arbitrary 
distribution of mass-to-light ratios, 9(fi). In this case, all clusters have the same mass M , 
and the luminosity function, from equation (10), is 

<f>(L) = M L- 2 6(M /L). (14) 

Here, as in the previous example, <p(L) is determined entirely by 9(fi) and hence by x(t). 
Equation (14) also demonstrates that 4>(L) can be a broad function even when ip(M) has 
no width at all. In particular, if #(//) is a power law with exponent e, then <f>(L) is a power 
law with exponent a = — (2 + e), as expected, because equation (14) is then a special case 
of equation (12). Once again, this is not a viable explanation for the observed luminosity 
function of young clusters in the Antennae galaxies. 

Example 4-' power-law distribution of masses, ip(M) = BM^ , and arbitrary distribution 
of mass-to-light ratios, 9{fj). According to equation (10), the luminosity function is 

POO 

<j>{L) = BL 13 / 9(fi)fi 1+p dfi. (15) 
Jo 

In this case, <p(L) and ip(M) have identical forms; both are power laws with the same expo- 
nent, irrespective of x( r ) an d the band in which luminosities are measured. This probably 
is a good description of the population of young clusters in the Antennae galaxies, for which 
the observed luminosity, mass, and age distributions are approximate power laws with ex- 
ponents a ~ (3 ~ —2 and 7 « — 1, respectively. In this with any power-law model 
of x{ T )i one might wonder whether the integral in equation (15) converges at small //, cor- 
responding to small r. However, in all the photometric bands of interest here (near UV 
through near IR), /x(r) follows a power law down to a finite, minimum value fi m in, which 
occurs at r ~ few x 10 6 yr (Leitherer et al. 1999; Bruzual & Chariot 2003). This truncates 
6>(/i) for /i < yU min and ensures that the integral is finite. 



4. DISCUSSION AND CONCLUSIONS 

We can now answer the question posed in the Introduction: Why does the luminosity 
function of young clusters in the Antennae galaxies have the same, or approximately the 



1 See Hogg & Phinney (1997) for further discussion of this possibility in the context of the luminosity 
function of galaxies. 
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same form as the mass function? As the examples in the previous section show, the answer 
has two parts: (1) because the mass function is a power law, and (2) because it is statistically 
independent of the age distribution. The exact form of x( r ) is immaterial in the relation 
between 4>(L) and if)(M) provided only that it declines steeply enough that Example 2 above 
is irrelevant. The independence of ip(M) and x( r )i however, is a key element. This may 
at first seem like only a mathematical convenience, but in fact it has important physical 
implications, as the following discussion makes clear. 

Star clusters are disrupted by a variety of processes, beginning with the removal of 
interstellar matter by stellar activity (photoionization, winds, jets, supernovae), leading to 
"infant mortality." The later disruptive effects include stellar mass loss, dynamical friction, 
gravitational shocks, and stellar evaporation driven by two-body relaxation (see Spitzer 1987 
for a review). In massive galaxies like the Milky Way and the Antennae, dynamical friction 
and gravitational shocks, while potentially important for some clusters near the centers of 
the galaxies, have relatively little effect on most of the other clusters. Moreover, the mass 
loss by stellar evolution and gravitational shocks, even when these processes are important, 
tends to preserve the shape of the mass function of the clusters; in particular, an initial 
power law remains a power law with the same exponent [see eqs. (12) and (13) of Fall & 
Zhang 2001]. 

The dominant long-term disruptive effect for most clusters is the gradual escape of stars 
resulting from gravitational scattering by other stars within the same clusters ( "evaporation" 
driven by two-body relaxation). We denote the rate of mass loss —dM/dt by this process 
by /i cv (not to be confused with a mass-to-light ratio). For clusters with a constant mean 
internal density, set by the smooth tidal field of their host galaxy, /x ev also remains constant, 
and M decreases linearly with time t. If the clusters of current age r formed at a rate h(r) 
in the past, with an initial mass function ip Q (M), then the current bivariate distribution of 
masses and ages is given by 

g(M,T)=MM + fi cv r)h(r). (16) 

[This is a straightforward generalization of eq. (11) of Fall & Zhang 2001.] Evidently, g(M, r) 
is independent of M for Af C AW 7 ", has a turnover at M ~ fi cv T, and is proportional to 
ipo(M) for M 3> fXevT- This form of g(M, r) is not covered by the examples in the previous 
section because it is not the product of a function of M alone and a function of r alone, 
as in equation (7). We note incidentally that this is true of any disruptive process for 
which the mass-loss rate of a cluster depends on its mass, as in all models of the form 
dM/dt = -M/t d (M) with t d (M) oc M fc , except the special case k — 0. 

It is now interesting to consider the clusters in the Antennae galaxies in the context 
of equation (16). Detailed dynamical calculations for massive galaxies like the Milky Way 
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and the Antennae give fj, ev w 2 x 10 5 M yr 1 and hence a turnover or "peak" in the mass 
function of the clusters at M p « 2 x lO 5 (r/lO lo yr)M (Fall & Zhang 2001). This means that 
the clusters most affected by stellar evaporation have r > M//i ev pa 5 x 10 8 (M/10 4 M o ) yr 
and thus are not included in the empirical determination of g(M,r). We are thus led to 
a two-stage picture for the disruption of the clusters (Fall et al. 2005). In the first stage, 
the energy and momentum input from massive young stars removes the interstellar matter 
from protoclusters, leaving many of them gravitationally unbound ("infant mortality"). The 
empirical evidence is that this process disrupts most of the clusters relatively rapidly but 
largely independently of their masses, thus reducing the amplitude of the mass function while 
preserving its initial power-law shape. Since nearly all the observed clusters are in this phase 
of evolution, the illustrative examples of the previous section apply, as does our explanation 
for the similarity between the luminosity and mass functions. The few clusters that survive 
the first stage are then liable to disruption in the second by stellar evaporation driven by 
two-body relaxation. As noted above, this process eventually changes the shape of the mass 
function into that of old globular clusters. 

How general is this picture? Does it apply to the clusters in all galaxies, or only to those 
in a small subset of galaxies like the Antennae? The initial mass function of star clusters is 
probably similar to the mass spectrum of their parent molecular clouds, and it is possible 
that this is determined by nearly universal, fractal-like density and/or velocity fields in the 
interstellar medium (Elmegreen 2002 and references therein). Moreover, the disruption rates 
in both the first (rapid) and second (slow) stages discussed above depend mainly on the 
properties of the clusters, not those of their host galaxies. Thus, we expect the picture 
outlined here to be fairly general. Nevertheless, it is important to test it observationally in 
detail in at least a few more cases, especially in non-interacting, quiescent galaxies. This 
requires determinations of the mass and age distributions over wide ranges of mass and age, 
which in turn requires deep multiband photometry of large samples of clusters. This should 
be a higher priority than simply determining the luminosity functions of clusters in more 
galaxies. 

Our main conclusion, at least for the young clusters in the Antennae galaxies, is that 
the luminosity function has the same form as the mass function because the latter is a power 
law and is independent of the age distribution. But the main lesson of this paper is that 
disentangling the relations between the luminosity, mass, and age distributions has helped 
to clarify our picture of the formation and disruption of the clusters. 

The author is grateful to Rupali Chandar, Michael Santos, Bradley Whitmore, and the 
referee for helpful comments. 
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